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Abstract 

£N| | A classical result in undirected wireline networks is the near optimality of routing (flow) for multiple- 

unicast traffic (multiple sources communicating independent messages to multiple destinations): the min 
cut upper bound is within a logarithmic factor of the number of sources of the max flow. In this paper we 
"extend" the wireline result to the wireless context. 
5/5 ' Our main result is the approximate optimality of a simple layering principle: local physical-layer 

O . schemes combined with global routing. We use the reciprocity of the wireless channel critically in this 

result. Our formal result is in the context of channel models for which "good" local schemes, that achieve 
the cut-set bound, exist (such as Gaussian MAC and broadcast channels, broadcast erasure networks, fast 
t> ' fading Gaussian networks). 

Layered architectures, common in the engineering-design of wireless networks, can have near-optimal 
| performance if the locality over which physical-layer schemes should operate is carefully designed. Feed- 

^-j- . back is shown to play a critical role in enabling the separation between the physical and the network layers. 

The key technical idea is the modeling of a wireless network by an undirected "polymatroidal" network, 
for which we establish a max-flow min-cut approximation theorem. 

1 Introduction 

■ i 

Wireless networks are typically engineered using a layered approach: the physical layer deals with the channel 
noise, the medium access control layer deals with scheduling of users in the wireless context, the network 
layer handles routing of information and the transport layer deals with network congestion. While this design 
methodology has several engineering advantages that have led to the proliferation of wireless networks, a 
fundamental understanding of layering architectures is still lacking. 

In this paper, we look explicitly for layered communication strategies that are near optimal for multiple 
unicast in general wireless networks. This would serve two (distinct) objectives: 

• To obtain the approximate capacity of multiple unicast in wireless networks, and 

• To establish a layered communication architecture that can guide engineering design. 
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1.1 Prior Work 



Fundamental understanding of layering architectures has recently received plenty of attention from the net- 
working community |[55l 11541 , and scenarios have been identified under which a joint optimization of the 
transport and network layers naturally decompose into separate optimization problems, thus yielding a justi- 
fication for the layered architecture. While there have been attempts to include certain aspects of the wireless 
medium into this framework |[66l . the understanding is far from complete. In this paper, we take a fundamen- 
tal, information theoretic perspective, on if and when, the physical, medium access and network layers can be 
separately designed. 

1.1.1 Capacity Results for Wireless Networks 

Substantial progress has been made in the recent past in understanding the key aspects of the wireless medium 
(broadcast and superposition) from an information-theoretic view point. In particular, the capacity of MIMO 
broadcast channel has been resolved [221, approximate capacity of the 2-user interference channel has been 
established |[2T1 and the approximate capacity of 2-user X-channels l59l ||39l , if -user interference channels 
with diversity IPT41 . ll65l Il24ll64l has been obtained. While these results establish information theoretic under- 
standing of several important ("physical layer") channels, there is no conceptual guideline to fit the solutions 
for reliable communication for the channel in the context of a bigger network it could be a part of. In a 
different direction, there has been significant progress in understanding network-level capacity issues in the 
context of simple traffic models, starting from the breakthrough work lfl2l . where the approximate capacity 
of single unicast is characterized, and later generalized to several scenarios: the approximate capacity of uni- 
cast in discrete memory less networks is characterized in lIBTl . a separation result between analog and digital 
components in relay networks is established in ifTOl and the approximate capacity of broadcasting in Gaussian 
networks is established in (21 . 

While unicast traffic in general Gaussian networks and multiple unicast traffic in single-hop Gaussian 
networks are reasonably well understood, the capacity of multiple unicast traffic in Gaussian wireless net- 
works remains an open problem in multi-terminal information theory. In recent times, several research groups 
have made progress on this problem ll80l l63l l40l l84l [TT1 l34l , but the general problem still remains unsolved. 
Specific directions, with promise of success, involve simplifying the problem by considering specific traffic 
patterns such as 2-unicast ll63l l80l l47l [90l [9T1 |4T1 18T1 l32l l86l ; another approach is to consider more specific 
network topologies, like for example, K sources communicating to K sinks via L fully-connected layers of 
K relays each BOllTOl . While these existing works attempt to compute the degrees-of-freedom (or approx- 
imate capacity) exactly for specific instances of the problem, we adopt a different viewpoint and focus our 
attention on obtaining general results for arbitrary networks (at the expense of obtaining potentially weaker 
approximation in specific instances). 

In the context of multiple-unicast in large wireless networks, there has been significant progress in un- 
derstanding scaling laws for geographical wireless networks; beginning with the seminal work in ll33l and 
culminating in the hierarchical relaying scheme in f73l and a combination of the two [68 , 69l|7H (with several 
critical works in between ||95ll56i r7l l94ll27l ). Despite its significant advantages, the performance guarantees 
are only in the context of certain specific wireless network models and, more importantly, the communication 
scheme is not a representation of a simple layered architecture for communication. 

1.1.2 Information-theoretic Layering Architectures 

Separation theorems form a basic tool in information theory: in his celebrated paper 1791 , Shannon showed 
that source coding (compression) and channel coding (communication) can be separated without loss of op- 
timality. Following this, several separation (and non-separation) theorems have been proved in the multi- 
terminal context (see, for example, lfT8ll20ll ). The result most relevant to the current discussion is the sepa- 
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ration between network coding and channel coding proved in the pioneering work in B31 . There it is shown 
that, for a wireline network composed of independent noisy channels, a separation architecture composed of 
a physical layer that performs independent coding for each channel and a network layer which transports bits 
across the induced noiseless network, is optimal. This is a very interesting structural result that holds under 
arbitrary traffic models and is proved without the necessity to compute the capacity of the network. Thus the 
question of studying the capacity of wireline networks can be reduced to the question of studying the capacity 
of capacitated graphs. 

For &-unicast in wireline graphs, a very interesting dichotomy is known in the theoretical computer science 
literature: for undirected graphs, the classical work of Leighton and Rao ll50l shows that routing achieves the 
min-cut to within a C(log k) factor and furthermore, there is a standing conjecture ll52l ll35l ll43l ll48l which 
claims that routing is infact optimal; whereas for general directed graphs, it has been shown recently |[T6l 
that there is no polynomial time algorithm that can approximate the value of min-cut to within a k e factor. 
Since max-flow can be computed in polynomial time, this result implies that there are networks for which 
flow cut gap is greater than k e . Furthermore, it has been proved recently |[T5l that computing the network 
coding region for directed graphs is equivalent to determining the entropy vector region, which is believed to 
be a very hard problem. Thus the string of positive results in the context of undirected (or bidirected) graphs 
and negative results in the context of directed graphs serve as an indicator that it may be easier to understand 
bidirected wireless networks. 

A study of layering in directed wireless networks is intiated in ll46l . The key idea is that of channel 
emulation, where a given channel is upper bounded by a wireline network with joint capacity constraints in 
such a way that the wireline network can emulate all possible behavior of the channel. This is a very strong 
condition, which ensures that the channel can be upper bounded by the wireline network irrespective of the 
traffic pattern. This program has been already accomplished for networks of 2-user MAC and broadcast 
channels, but appears to be very hard for general networks. In this paper, we instead work with only multiple- 
unicast traffic but go on to study layering in general bidirected networks. 

1.1.3 Polymatroidal Networks 

Polymatroidal networks and approximation theorems for these networks form a crictical component in this 
paper. Polymatroidal networks are similar to edge capacitated networks, however, in addition to having 
edge capacity constraints, there are joint capacity constraints on the set of edges which meet at a vertex. 
Polymatroidal networks were introduced by Lawler and Martel |49] and Hassin ll36l in the single-commodity 
setting and are closely related to the submodular flow model of Edmonds and Giles ll23l ; the well-known 
maxflow-mincut theorem holds in this more general setting (see Chapter 60 in 11771 for a discussion). Multiple 
unicast in polymatroidal networks has not been studied till the recent work of Q, where an approximate 
max-flow min-cut result is established for undirected polymatroidal networks. 

Recently, there have been several applications of network flows, and their generalizations such as flows 
in linking systems f78l . to information flow in wireless networks lTT2l [9l [3TI P761 l42ll - The polymatroidal 
structure of the multiple access channel capacity region was observed and exploited by Tse and Hanly ll82l . 
Directed polymatroidal networks were utlized in the work of Vasudevan and Korada [85], where a separation 
architecture for a network of deterministic broadcast and MAC channels converts the network into a poly- 
matroidal network and existing results for broadcasting in polymatroidal networks |[25l are used to obtain 
capacity bounds. 

1.2 Summary of Results 

An important contribution of this paper is the following meta-theorem: 
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Meta Theorem: If there is a "good" physical-layer scheme (which is approximately cut-set 
achieving and reciprocal) for a certain channel, then, for multiple-unicast in a network com- 
posed of such channels, a layered architecture is approximately cut-set achieving ( to within a 
logarithmic factor in the number of messages). 




Network layer: Routing 



Analysis 



Cut equivalence 



Figure 1 : Layered Architecture 

Fig. Q] depicts the layered architecture used in this paper. The following is a summary of the key ideas 
which are used in this paper to argue that a layered architecture is approximately capacity optimal: 

• Model a wireless network as abidirected network, by using the natural reciprocity of wireless networks. 

• Utilize a good local "physical layer" scheme for each channel and identify the combinatorial structure 
of the rate region (typically submodularity). 

• Show that local physical layer schemes convert a wireless network into a bidirected polymatroidal 
network. Thus the bidirected polymatroidal network can be viewed as a graphical model for wireless 
networks. 

• Prove a Leighton-Rao type approximation result for bidirected polymatroidal network, which shows 
that routing is near optimal for /c-unicast traffic. 

• Argue that the layered architecture with local physical layer scheme + global routing achieves the cut- 
set approximately in the wireless network. 

• We provide a technique by which "good" results for a given channel can be lifted up to good results for 
a general network comprised of those channels. 

We justify the meta-thereom formally in the context of the following channel models: 
1. Networks composed of Gaussian broadcast and MAC channels 
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2. Networks composed of broadcast erasure channels with feedback 

3. Fast fading wireless networks 

4. Degrees-of-freedom approximation for fixed wireless networks 

5. Linear deterministic networks composed of MAC and broadcast channels 

6. Networks composed of MIMO MAC and broadcast channels with delayed CSI feedback 

7. Fast fading linear deterministic networks 

For each of these networks, under a general £>unicast traffic model, the approximation factor on the rate 
is C(log k) for the entire rate region in addition to the loss incurred due to the physical layer scheme, which 
is typically a power-scaling loss. Under more specific traffic models, such as the X-traffic model (where each 
of the J sources have messages to send to each of the K destinations) or a "group-communication" traffic 
model (where a subset S of nodes have messages to send to each other), we prove a constant approximation 
factor for the sum-rate, again in addition to a power scaling loss (the constant being 1, 2 or 4 depending on 
the specific channel and traffic model). 

The material in this paper was presented in part in (4) HI. 

1.3 Organization 

The rest of this document is organized as follows: 

• An overview of the layering approach is provided in Sec. |2] After defining the "locality" over which 
local physical layer schemes must be implemented, a list of desirable properties of local solutions are 
provided. 

• In Sec. [3j polymatroidal networks, which form the back-bone of the layering architecture, are defined 
and results for multiple-unicast in polymatroidal networks are presented. 

• In Sec. 01 "good" local physical layer solutions are described for various channel models. For some 
models, it is shown that existing schemes satisfy the desirable properties, whereas for other models, 
where existing schemes are insufficient, new ones are constructed. 

• In Sec. [5J the local schemes are fitted into a global network context. Capacity theorems are proved for 
the various channel settings by connecting the wireless network problem formally to the polymatroidal 
network problem. 

2 Layered Architecture 

Engineering approaches to reliable network communication involve "layering", a separation of the roles of 
physical (dealing with channel uncertainty), medium access (dealing with sharing the wireless medium) and 
networking (dealing with end-to-end the resulting "wireline" network communication). On the other hand, 
fundamental architectures are suggested by information theoretic study of large wireless networks (a major 
research direction in the past decade, with performance measured in a coarse scaling context). For instance, 
multihop routing ll33l is a layered architecture, while hierarchical MIMO f73l (nearly scaling-law optimal in 
a geographically uniform context) is not. The information theoretic understanding of layering architectures 
has recently started receiving attention (see I45ll46l ). Our approach is in similar lines as the approach in (85j, 
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where a layered architecture for a network of deterministic broadcast and MAC channels is used to obtain 
capacity bounds. 

In understanding the systematic design of layered architectures, it helps to look at the global wireless 
network as a collection of "local" wireless networks. The focus of this section is to introduce this view point; 
we propose that the notion of locality comes from both geographic (spatial) and temporal contexts. We see 
that certain combinatorial properties of the (physical layer) solutions to the local networks are desirable; these 
will help prove fundamental guarantees on the performance of the layered architecture in the global context. 

2.1 Locality 

A wireless network is a collection of local channels, if there are no interactions between the channels. For- 
mally, a wireless network composed is defined as follows: Consider a graph G = (V, E). For each v € V, 
x v ,y v denote the transmit and received symbol respectively. Let C + (v), C~(v) denote the set of all channels 
in which v can transmit and receive from respectively, i.e., x v can be written as x v = {x c v } c£ c+(v)> an d 
Vv — {Dv}cec~{v)- We will consider a wireless network with independent noise, where 

) = Tlv^(yv\{x u }u:{u,v)£E), (!) 

and this description explicitly captures the relationship between the graph and the joint probability transition 
function. 

Consider a set of channels c G C. A wireless network is said to be composed of channels c € C if the 
probabilistic description of the network is of the form, 

) = IL c F({y c v } veV -, c) \{x c u } 

ngV+fc )) (2) 

= n c n^ Gl/ -( C )P(y^|{x^} ue i/+( C ).( Ui „) g£; ), (3) 

where V~(c) = {v : c S C~(v)} and V + (c) = {v : c G C + (v)}. Each channel c is referred to as a 
component channel of the network. 

A canonical scenario occurs when the wireless network is simply a collection of statistically independent 
noisy channels. Here each channel between a transmitter and a receiver is local. A more interesting example 
occurs in the case of a. frequency planned wireless network, where each component of the wireless network 
operates in a specified frequency range. Here, the overall channel model can be decomposed as the product 
of channel models in each frequency range; the scale of locality corresponds to the scale of frequency reuse. 

In general, such a geographic decomposition (via frequency planning) may not happen. Nevertheless, we 
can view the decomposition as occurring in time (indeed, this has been a popular method for analyzing general 
wireline / wireless networks (6j[l2l). When we decompose across time, the local channel corresponds to the 
global one, as viewed over a specific single block of time. In this context, the layering architecture restricts 
the sophistication of physical layer (and medium access layer) strategies to be restricted to operate on a single 
layer in time, and at the end of each epoch, the information is decoded and re-encoded (using the networking 
layer) for the next local channel. The layering architecture thus enforces decoding of all information at each 
"hop" (in time); schemes, such as quantize-and-forward lfT2ll . which forward analog information do not fit the 
layered architecture model. 

2.2 Desirable Properties of Local Solutions 

A natural desirable property of any (physical layer) solution to a local channel is it be as optimal (from an 
information theoretic view point) as possible. In particular, we will be interested in how close the solution 
is to fundamental upper bounds given by the cutset bounds and certain natural combinatorial properties of 
the solution. For a network described by a probability transition matrix, the cut-set bound can be written as 
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follows. Given a cut Q C V, let Dq be the set of demands separated by the cut, i.e., Dq := {k : Sk € ifc £ 
Sl c }. The cut-set bound bounds the sum of rates of sources in Dq and can be written asQ 

]T R k < sup l(X n ;Y nc \X U c). (4) 

Our focus on cut-set bounds as opposed to specialized outer bounds for specific wireless channels (such 
as the broadcast and interference channels) is motivated due to the following reasons. 



• Generality: The cut-set bound [28 ] is an information theoretic outer bound on the achievable rate region 
and it can be written down for a general wireless network. 

• Decomposition: The chain rule of mutual information allows the cut-set bound of a network to de- 
compose into the cut-set bounds on local channels; thus solutions that come close to the cut-set bound 
at a local level have a potential to be layered and be still close to the cut-set bound at a global level. 
Formally, if we have a cut (ley, the value of the cut is given by 

l(X n ;Y nc \X n c) = H(Y Q c\X n c) - H(Y Q c\X v ) (5) 
= H(Y ac \X nc )-J2H(Y£ c \X^) (6) 

c 

< J2H(Y£ c \X Q c)-H(Y U c\X v ) (7) 

c 

< ^2u(Y^\X^)-H(Y nc \X v ) (8) 

c 

= ^H(y n c c |^ c )-H(Y n c|X£) (9) 

c 

= ^ip^ ; y n c c|^ c ), do) 

c 

and thus the cut-set decomposes into sum of the cut-sets evaluated for each channel. 

• Structure: Cut-set bounds have been well studied in the theoretical computer science literature and their 
combinatorial structure have been well understood. In fact, algorithms for approximately computing 
the cut-set bounds form an integral part of the theory of approximation algorithms. 

• Invariance under feedback: The cut-set bound (evaluated under general joint distributions) is essentially 
a bound based on upper bounding the rate by the rate of a point-to-point channels and is, therefore, 
invariant to feedback. 

Finally, reciprocity of the local channels (rate region reciprocity with the roles of transmitter and receiver 
reversed) will be paid attention to. The combinatorial structure imposed by the bidirected nature of each local 
channel will yield to efficient algorithms that are close to cuts. 

2.3 Layering Methodology 

Layering architectures stitch together the local solutions into a global solution: 

1. The solution to a local channel allows for reliable digital communication at a local level. 



While there is a stronger way of writing this bound, this weaker form of the bound will suffice for the purposes here. 
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2. Replacing each local channel by a set of (wireline) links leads to a network comprised of noiseless 
channels, with the rates on the various edges are coupled by the rate regions of the local solutions. 
Reciprocal local solutions ensure that the network obtained is bidirected (i.e., any edge between node a 
to node b has a corresponding edge between node b to a, with the two edges being involved in the same 
types of capacity region constraints). 

3. Over the resulting wireline network, we might have to potentially employ network coding to re-encode 
the information between local channels. We utilize the combinatorial properties of the coupled rate 
constraints to study these new class of wireline networks. In particular, if the combinatorial struc- 
ture governing the rate constraints is a specific form of a polytope known as a polymatroid, we obtain 
polymatroidal networks. Therefore, we study polymatroidal networks (which have local polymatroidal 
constraints on rate region) and prove that routing can achieve the cut-set bound to within a O(logfc) 
factor for the A;-unicast problem, and also prove some better approximations for more specific commu- 
nication problems. 

4. Since the cut-set bound on a network of channels decomposes into a sum of cut-set bounds on the local 
channels, we can readily compare the performance of the layering architecture to a fundamental upper 
bound on the global network performance. 

Whenever local solutions are close to the cut-set bounds for the corresponding local channels, we can 
establish the fundamental near-optimality of the layering architecture. We have accomplished this program for 
several canonical local wireless channels including broadcast erasure channels, Gaussian uplink and downlink 
channels, and interference channels with diversity (example: fast fading). 

3 Polymatroidal Networks 

Polymatroidal capacity networks and max-flow min-cut results for polymatroidal networks form the back- 
bone of this paper. In this section, we give a brief introduction to polymatroidal networks. A detailed discus- 
sion of polymatroidal networks and the max-flow min-cut approximation result can be found in (T). 

3.1 Polymatroids 

A set function / : 2 N -> R over a finite ground set N is submodular iff f(A) + f(B) > f (An B) + f (All B) 
for all A, B C N; equivalently f(A U {i}) - f(A) > f(B U {»}) - f(B) for all A C B and % A. It is 
monotone if f(A) < f(B) for all A C B. A polymatroid refers to the following set in R^: 



where f(S) is a monotone submodular function with /(0) = 0. Thus a polymatroid is fully specified by spec- 
ifying a monotone submodular function with /(0) = (we will call such a function itself as a polymatroid). 
An example of a polymatroid is the following: given a set of N vectors v±, ...,vn in R M , the function f(S) 
denned as the rank of the matrix composed of {vi}i & s defines a polymatroid (we refer the reader to 11711 for 
an introduction on polymatroids) . 

3.2 Definition of Polymatroidal Networks 

A commonly studied wireline scenario is one where each edge is labeled by a capacity: this is the largest 
amount of information flowing on that edge. Here we are interested in a more general model which is able to 




(11) 
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handle the additional constraints when edges meet at a node, similar in spirit to the broadcast and supeiposition 
constraints in wireless. 

Consider a node v in a directed graph G and let 5q{v) be the set of edges in to v and 5q(v) be the set of 
edges out of v. In the standard model each edge (u, v) has a non-negative capacity c(it, v) that is independent 
of other edges. In the polymatroidal network for each node v there are two associated submodular functions: 
p~ and p+ which impose joint capacity constraints on the edges in Sq(v) and Sq(v) respectively. That is, for 
any set of edges S C Sq(v), the total capacity available on the edges in S is constrained to be at most p~(S), 
similarly for 5q(v). Note that an edge (u, v) is influenced by p+ and p~. 

A cut SlCF partitions the set of vertices into two sets £1 and Sl c . Let F(Q) denote the set of edges going 
from SI to f2 c (we will drop the dependence on Q if it obvious from the context). The cost/capacity of F is 
complex to define in polymatroidal networks (note that in standard networks it is simply ^ e£F c(e) where 
c(e) is the capacity of e). To define the cost in polymatroidal networks, each edge (it, v) in F is first assigned 
to either it or v; we say that an assignment of edges to nodes g : F — s> V is valid if it satisfies this restriction. 
A valid assignment partitions F into sets {g' 1 ^) | v G V} where g~ l {v) (the pre-image of v) is the set of 
edges in F assigned to v by g. Then we minimize over all assignments. 

Definition 1. Cost of edge cut: Given a directed polymatroidal network G = (V, E) and a set of edges 
F C E, its cost denoted by v{F) is mm g , F ^. v ,g valid J2 V {Pv ( 5 ~ ( v ) n (v)) + pt ($ + (v) C\ g' 1 (v))) . In 
an undirected polymatroid network u(F) is mm g: F-tv, g valid Ylv Pv i.9" 1 ( v ) )■ 

A max-flow min-cut theorem for the unicast problem in directed polymatroidal networks is known in 
the literature ll49l . This result has been extended to the broadcast traffic scenario in ll25l . Our focus here 
is on multiple unicast traffic, for which flow-cut gaps are currently unknown. The best known result for 
multiple-unicast, even in the traditional wireline network with capacity constrained edges, is the approximate 
optimality of max flow (in terms of being close to the min cut); this result is available (in a strong sense) only 
for bidirected (or undirected) networks. Our goal is to obtain a similar result for bidirected polymatroidal 
networks. 

We define a bidirected polymatroidal network as a directed polymatroidal network with the following 
properties. 

• Every edge e = has a corresponding reverse edge r(e) := 

• For any vertex v, the polymatroidal constraint p^(-) on the incoming edges In(i>) is the same as the 
polymatroidal constraint p^(-) on the outgoing edges Out(i>). More concretely, 

p-(E v ) = p+(r(E v )) VE v Cln(v). (12) 

3.3 Main Result 

The following theorem is proved in [:1|, which generalizes the results of ll50l to the case of polymatroidal 
capacity networks: 

Theorem 1. For a bidirected polymatroidal network with k source-destination pairs, the ratio between the 
max-flow rate region and the min-cut rate region is C(log k). The max-flow and an approximate min-cut 
can be calculated in polynomial time. Furthermore, this factor is tight in general, i.e., there are families of 
polymatroidal networks such that the flow-cut gap is fi(log k). 

Proof. (High level idea) The proof is done for undirected polymatroidal networks, whose capacity is within a 
factor 2 of bidirected capacity networks. The max-flow can be written down as a large linear program. While 
this program has an exponential number of constraints, a polynomial time algorithm can be obtained to solve 
this program using the polymatroidal structure. The dual of the linear program is related to the cut via an 
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integer relaxation. An important step in the proof is to simplify the constraint structure of the dual via the 
use of continuous extensions of submodular functions, in particular the Lovasz extension ll58l . The resulting 
program has a convex objective function but the constraint structure is much simpler. This convex program 
(which equals the dual of the flow) and the cut are related by an integer relaxation. 

The work of Linial, London and Rabinovich 1571 showed a fundamental (and tight) connection between 
flow-cut gaps in edge-capacitated undirected networks and low distortion embedding^ of finite metric spaces 
into ^™.This connection effectively reduced the flow-cut gap question to investigating £™ embeddings of finite 
metric spaces induced by graphs. 

For the polymatroidal network problem, embeddings into a general i\ spaces are no longer sufficient, and 
we require a specific type of embedding called line embedding, where the nodes are embedded into a line in 
E. These embeddings were implicit in the seminal work of Bourgain [13], first explored by Matousek and 
Rabinovich ll62l IT751 and later exploited in |[26l to obtain flow-cut gaps in undirected node-capacitated graphs. 
These results imply that every finite metric space of n elements can be embedded by a contraction into a line 
preserving the average distortion of k points to within a factor of C(log k). We use this result to connect the 
convex program written using the Lovasz extension to the cut, hence proving a flow-cut gap of C(log k). 

The converse part, i.e., the existence of families of polymatroidal networks with a gap of Q,{\ogk) fol- 
lows from the existence of families of edge capacitated networks (which are special cases of polymatroidal 
networks) with a flow-cut gap of 0(log k). 

□ 

3.4 Special Traffic Scenarios 

While in general, the factor of 0(logA;) for flow-cut gaps is tight for multiple-unicast in bidirected polyma- 
troidal networks, there may be special communication scenarios for multiple unicast when the factor can be 
improved. We present some instances here, where the flow cut gap is much better even for the more general 
case of directed polymatroidal networks. 

3.4.1 Broadcast traffic: 

Broadcast traffic is a special type of multiple unicast traffic where all the messages originate at a single 
source. Consider a directed polymatroidal network with a single source s having independent messages to K 
destination nodes t\,...,tK- 

Lemma 1. H25V For a directed polymatroidal network with broadcast-traffic pattern, the rate region of the 
max flow equals the rate region of the min-cut. 

3.4.2 Sum rate in directed X networks: 

Consider J sources S\,...,Sj and K destinations T\,...,Tk, where each source has an independent message 
for each destination. The rate tuple is a JK length vector Rjk between each j and k. This communication 
problem is referred to commonly as the X-network problem. 

Lemma 2. For a directed polymatroidal network with X -traffic pattern, the sum-rate of max flow equals the 
sum-rate bound given by min-cut. 

Proof. Construct a super source S which talks to the J sources with infinite capacity links and a super sink T 
which is connected from each of the K sinks via infinite capacity links. The max-flow min-cut theorem for 
unicast between S and T in directed polymatroidal networks |49l implies the desired result. □ 

2 An embedding of a metric space (V, d) (metric d on points V) into another metric space (V' , d') is a mapping / : V — ¥ V . The 
distortion of / is the smallest a > 1 such that — ■ d(u, v) < d'(f(u), f(v)) < a ■ d(u, v) for all u,v € V. 
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3.4.3 Sum rate for group communication in directed networks: 

Consider a directed polymatroidal network with a specially marked group of nodes S C V. Each node 
s in S has an independent message for every other node in S. Thus it is a multiple unicast problem with 
| .SI (151 — 1) messages. We refer to this traffic pattern as the group-communication traffic pattern. Suppose 
we are interested only in maximizing the sum-rate. 

Lemma 3. £7]/ For a directed polymatroidal network with a group-communication traffic pattern, the sum-rate 
ofmax-flow is greater than half the sum-rate bound given by min-cut. 

Proof. The proof of this theorem is non-trivial and requires a reduction from the directed polymatroidal net- 
work to the directed edge capacitated network using a combinatorial uncrossing argument. For a directed 
edge-capacitated network, this theorem is proved by Naor and Zosin lloTll . For a detailed proof of this state- 
ment for polymatroidal networks, we refer the reader to HI. □ 

4 Local Physical Layer Schemes 

In this section, good local physical layer schemes for several channel models will be discussed. For each of 
these channels, the goals will be to identify a physical layer scheme, quantify its rate region, understand its 
closeness to the cut-set bound and to examine its combinatorial structure. We will also analyze if the rate 
region remains (approximately) the same when the sources and destinations are exchanged and channels are 
reversed. In later sections, these properties will allow us to stitch together local physical layer schemes to get 
global schemes. The results in this section for various channel models are summarized in TableQ] 

We will use the notation TZ^ h and 7£* t to denote the achievable and the cut-set rate regions respectively, 
where ch denotes the channel of interest. 

4.1 Linear Deterministic Broadcast and Multiple Access Channels 

Consider a broadcast channel with d receivers of the form, 

yi = H iX , V* = l,2,...,d, (13) 

where yi is the received vector at receiver i and x is the transmitted vector. The source intends to communicate 
independent messages to each of its destinations. For a subset K of {1, 2, d}, let Hjc denote the matrix 
with Hi,i € K stacked up alongside one another. The capacity region of this broadcast channel ll60l is given 
by 

C = {{Ri, R d ) : Ri < Rank(F^) VA C [d]}. (14) 

ieK 

This capacity region is also equal to the cut-set bound, which is a polymatroid (see fTTI ). 

Let us consider a "reciprocal" multiple access channel in which there are d transmitters and one receiver, 

y = Y,Hfxi, (15) 

i 

and all the transmitters have an independent message to transmit to the single destination. The capacity region 
of a general MAC channel is known (see, for example, Chapter 14 in lfl7l ) and for the linear deterministic 
channel, is given again by the rate region in (fT4l . We observe that the capacity of the broadcast channel and 
the reciprocal MAC channel are the same and are equal to their cut-set bound. 

Thus for a linear deterministic broadcast and MAC channel, the rate region is exactly polymatroidal, equal 
to the cut, and is reciprocal. 
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Characteristic / 
Channel 


Closeness-to-Cut 


Combinatorial 
Structure 


Reciprocity 


Linear Deterministic MAC / BC 


Exact 


Polymatroidal 


Exact 


Gaussian MAC / BC 


Approximate 


Polymatroidal 


Approximate 


Erasure Broadcast 


Far 


Polytope 


Far 


Erasure Broadcast (Feedback) 


Approximate 


Polymatroidal 


Approximate 

t t 


Fading MAC / BC 
with delayed CSI 


Approximate 


Polymatroidal 


Approximate 


Fading linear deterministic 


Approximate 


Polymatroidal 


Approximate 


Fading X-Channel 


Approximate 


Polymatroidal 


Approximate 


Fixed X-Channel 


Approximate 
in DOF 


Polymatroidal 
DOF region 


Approximate 
in DOF 



Table 1: Canonical wireless channels, as viewed via three lenses. 



4.2 Gaussian Broadcast and Multiple Access Channels 

Let us first consider a multiple access channel, defined by 

y = S ^h i x i + z, (16) 

i 

where the transmitted vector Xi is constrained by a power constraint P at each of the d nodes, y is the received 
vector and z denotes the noise, which is of unit power. Let the rate region achievable on this multiple access 
channel be denoted by T^ C ^ C {P)- This region is known to be polymatroidal |[82l . This rate region equals the 
cut-set bound evaluated under product distributions ifTTl , i.e., 

^ach C (-P) = ^cutproduct(-f)- (^) 

Let the cut-set bound evaluated under general distributions be given by T^^g enem \{P)- We can easily verify 
the relation, 

^ach C (-P) ^ ^outgeneral (l[J ' 

Next, let us consider a "reciprocal" or "dual" broadcast channel, given by, 

yi = HiX + Zi, Vi = 1,2, ...,d, (19) 

where the transmitted vector x is constrained by a power constraint dP, yi is the received vector and Zi 
denotes the noise at each receiver, which is of unit power. Let us call the rate region of the broadcast channel 
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as TZ BC (P). This rate region has been fully characterized, but is not equal to the cut-set bound and is not 
polymatroidal (see Chapter 6 in f83l for a discussion). 

The rate region of this broadcast channel with sum power constraint Pd contains the rate region of the 
multiple access channel with individual constraint P at each node |[87l[88l . i.e., 

t^(p) 5 nX c {P)- (20) 

For purpose of symmetry, we can choose to operate the broadcast channel at the rate region specified by 
ft MAC (P),i.e.,let us set 

«0P) = « C (n (2D 

This will also ensure that the rate region of the achievable scheme Tl^(P) is polymatroidal. Thus, in our 
achievable strategy, the rate region of a multiple access and that of the dual broadcast channel are equal and 
given by a polymatroidal region. 

Let the cutset bound for the broadcast channel be specified as ^-fm general (^)- Since there is only one input 
variable, the cut-set bound under product distribution and general distribution are the same. 

Lemma 4. The achievable region of the MAC channel compares with the cutset bound under general distri- 
butions as follows, 

For the broadcast channel, we have the relation, 

ftach(P) 2 T^ut.general (f~rf^j ' (23) 

Proof. The proof is deferred to Appendix lAl □ 

Thus for a Gaussian broadcast and MAC channel, the rate region is approximately polymatroidal, close to 
the cut, and is approximately reciprocal. 



4.3 Broadcast Erasure Channels 

Consider a network comprised of broadcast erasure channels. For a broadcast erasure channel with d receivers, 
the channel model can be written as, 

Di = eix, Vz = 1, 2, d, (24) 

where e{ is a binary random variable which when represents that at receiver i, the packet got erased. If e; 
are all independent, then the broadcast erasure channel is said to be an independent erasure broadcast channel. 
Such a channel is specified by erasure probabilities i = 1, 2, d, where 

d = Pr{ ei = 0}. (25) 

For the purpose of simplicity in this paper we will consider only broadcast erasure channels that are indepen- 
dent and symmetric, which implies that there is only one parameter e and = e Vi. 

Lemma 5. For an erasure broadcast channel without feedback, the cut-set bound can be as large as L times 
the achievable rate region. 

Proof. See Appendix iBl □ 

This result implies that for broadcast erasure channels (without feedback), there are no good local schemes 
that can achieve close to the cut. 
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4.3.1 Broadcast Erasure Channels with Feedback 



Since there are no good local schemes for the broadcast channel, we suggest that the scale of the locality be 
enlarged to include the presence of feedback links in the physical model to get better local schemes. 

The capacity of the erasure broadcast channel with ACK feedback (the receiver acknowledges whether it 
received the packet) was studied by IT301 for the two user channel, and later extended to the more general case 
independently by ll89l and 11291 . The schemes are based on network coding and interference alignment and 
demonstrate that the following rate region is achievable. 



Lemma 6. I \89\ \29v The following rate region is achievable for the erasure broadcast channel with ACK 
feedback, 



R 



ach,Jb 



(Ri,...,R D )\ E rZ^^ 1 VttL 



(26) 



Here 7r is a permutation of the set (1, . . . , d). Note that this region is not polymatroidal. However, it is 
close to the min cut rate region (which is itself polymatroidal) as seen below: 

Lemma 7. 

Tlachjb 5 , nn cu j: ■ (27) 
O(logd) 

Proof See Appendix O □ 

The reciprocal nature of wireless channels from which the broadcast erasure channel is constructed nat- 
urally suggests a way of providing feedback links of commensurate strength. Formally: a channel is said to 
have commensurate feedback if there is are feedback links from the various receiving nodes to the transmit- 
ters with the same rate region as the cut-set bound for the forward channel. In Appendix 151 we look at one 
possible way of obtaining feedback links of commensurate strength as the forward links. 

Thus for a broadcast erasure channel with feedback, the rate region is approximately polymatroidal, close 
to the cut, and is approximately reciprocal. 

4.4 MIMO Broadcast and MAC Channel with Delayed Feedback 



We will consider MIMO broadcast and MAC channels, similar to Sec. 14.21 but with the difference that the 
channel states are i.i.d. over antennas and time. If the CSI is instantaneously available at the transmitter 
and the receiver, then the methods and results of Sec. 14.21 continue to hold. However, the problem becomes 
interesting when global channel state information is no longer available. In some settings, the channel may 
change so fast that by the time, the channel state feedback reaches the transmitter, the channel state has 
changed significantly. This feature is essentially present in the erasure channel if we think of erasure as being 
a channel state, which is unknown to the transmitter originally but the presence of the ACK feedback delivers 
this channel state to the transmitter with a delay. Recent work by Ali and Tse ['8j] showed the surprising result 
that even in Gaussian networks, delayed CSI can be very beneficial as compared to the absence of CSIT ll96l . 
This will form the basis of our investigation of these channels. 

We assume that each broadcast and MAC channel gets feedback from its receivers about the channel state, 
however, the channel changes before the feedback arrives, precluding the use of feedback to predict the future 
state of the channel. We will resort to a degree-of-freedom characterization for this problem. Let d{ denote 
the achievable degrees of freedom for the z-th message, i.e., 

SNR->oo log SNR 
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where Ri is the achievable rate for user i. The achievable DOF region is denoted by V a( ±, and the region 
given by the cutset bound is denoted by V cat . 

Such a multiple access case is easy to deal with because even without channel state information, the cut-set 
bound can be achieved, 

T-)MAC _ t>MAC /1Q\ 
u ach — u cut ■ 

However for the corresponding broadcast channel, in the absence of CSIT, we can only achieve rates that are 
far from the cut-set. In JSJ, it is shown that the presence of channel state feedback, even when delayed, can 
significantly alter the situation, as was the case with broadcast erasure channels: 

Lemma 8. [8] For a fading MISO broadcast channel with a source with L transmit antennas and L single 
transmit antenna receivers, the following DOF region can be achieved with the help of delayed CSIT: 

\. i=i 2-/i=l i ) 

Our first result is to obtain an approximation for the rate region of MIMO broadcast channels with delayed 
feedback, formally stated in the following lemma. 

Lemma 9. For a fading MIMO broadcast channel with I transmit antennas and d users with user i having 
rrii antennas, and delayed CSI feedback, the following DOF region can be achieved: 

v bc 

U ach =i \) ^ ' 

where p := min(7, mi) is the minimum of the number of transmit and receive antennas in the system. 

Thus for a fading MIMO broadcast channel, the rate region is approximately polymatroidal, close to the 
cut, and is approximately reciprocal. 




4.5 Fading X-channels 

Consider a L-user X-channel, there are L sources s\,...,sl and L destinations ti,...,tz with each source 
having message to send to each destination, thus there are I? messages in total. The connectivity graph 
between the nodes is a bi-partite graph with E being the set of edges. We will abbreviate edge (sj, tj) as (i, j) 
since the meaning is clear from the context. 

Note that an interference channel is a special case of this channel. Capacity achieving schemes even for 
the L-user interference channel are not known in the general setting. In [14], the authors show that each user 
in a L-user interference channel can achieve half their point-to-point degrees-of-freedom (DOF) if the channel 
is fast-fading, using a mechanism called interference alignment, which was initially proposed for the 2-user 
X-channel in |59l . This result has since been generalized in several directions; most notably, in ll65l . it is 
shown that using an "ergodic interference alignment" scheme, each user can get half her rate at all SNR, and 
in ll64l . it is shown that the DOF result can be proved even under fixed channel coefficients using a scheme 
termed "real interference alignment". These results have been unified into a single framework in ll93l . It has 
also been shown ll67l that ergodic interference alignment can be used to achieve linear capacity scaling in 
dense interference networks. 
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Figure 2: Cuts in X-Channel 



4.5.1 Cuts in an X-channel 

Consider an example X-channel with two sources and three destinations, shown in Fig. |2] Two cuts are 
marked in the figure. The light (green) cut f2i separates only destination 1 from the two sources, thus providing 
a bound: R\\ + R21 < Cq x , whereas dark (red) cut Q2 separates all sources from all destinations and 
therefore provides a bound on R\\ + R\2 + R13 + R21 + R22 + ^23 < Cq 2 . Of these two bounds, the 
first bound corresponds to that of a polymatroidal constraint whereas the second bound does not correspond 
to a polymatroidal constraint (since in a polymatroidal network, only edges that meet at a node have a joint 
constraint). 

For a general X-channel, these two types of cuts will be present, and we can classify them as 

1. Cuts that separate a single node from the rest of the nodes (referred to as cuts of the polymatroidal 
form), and 

2. Cuts that separate multiple sources from multiple destinations. 

We would like to show an achievable scheme that not only achieves the cut-set bound approximately, 
but also the rate region of the achievable scheme satisfies a polymatroidal constraint. Therefore, for an X- 
channel, we will have to show that only cuts of the polymatroidal form (separating one node from the rest) 
play a dominant role. This is a key challenge that we address in this section. 

4.5.2 Channel Model 

The channel model can therefore be written as, 



where Xi(t), yi(t), Zi(t) are the transmitted vector, received vector and noise vector at time t, In(i) represents 
the set of neighbors of node i who have an incoming edge to i and fading coefficient hij (t) is associated with 
edge G E at time t. The noise vector is assumed to have unit variance and is independent at each node. 
There is a power constraint of P per node. 

We will make the following assumptions about the fading distribution: 

• Fading coefficients are assumed to be i.i.d. over edges and over time. 




(32) 



j'Gln(j) 
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• The fading coefficient will be assumed to be symmetric, i.e., if hij is a discrete random variable, 

Pr{hij = a} = Pr{/ijj = — a}, Va (33) 
otherwise, if the random variable is absolutely continuous, the pdf p(.) must satisfy 

p(hij = a) = p(hij = -a) Va. (34) 

• The fading distribution is assumed to satisfy: 

o:=e -E0og|h| a ) <oo . (35) 



One example of a fading distribution that satisfies these assumptions is when hij (t) is i.i.d. across nodes 
and time with a complex gaussian distribution, for which a = 1.723 11721 . 

We will use the shorthand C(P) to denote the ergodic capacity of a fading channel with power constraint 
P, and the fading coefficient h of unit variance, 



C(P) := E h 



l -\og{l + \h\ 2 P) 



(36) 



4.5.3 Scheme for the K -user interference channel 



First, we consider the case of fc-user interference channel, where there are messages only from Sj to ij for 
each i = 1, 2, k. In the ergodic set-up, the following result is known: 



Lemma 10. 465 1/ For an k-user ergodic interference channel, where the direct links are non-zero, the follow- 
ing rate tuple is achievable: 

Ri = ^C(2P), Vt = l,2,...,fc. (37) 
4.5.4 Scheme for the X-channel 

We generalize this physical layer scheme to the X-channel with L sources and M destinations, demonstrating 
not only that the cut-set bound is approximately achievable, but also that only cut of the polymatroidal form 
are relevant. 

Theorem 2. For an L-source, M-sink ergodic X -channel, the following rate region is achievable, 



Zj:(i,j)eERij<h C ( 2P ) 
£ 4:(M)e ^ < ^(2P) ViGT 



V ach _ I (p>..)\ ^j:(i,j)€E ^ > " c ^ I nR N 



and furthermore, if d is the maximum degree of any node, 



nl h {P) 5 -^d, (39) 



where a := e~^ lo ^ h \ 2 \ 



Proof. Let us write Rtj for the rate of communication between s.j and t j . We use the following achievable 
strategy: 



Let us construct a bipartite graph between the source vertices and sink vertices, and edges given by E. 
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A matching in a bipartite graph is a choice of edges such that each node is present in at most one edge. 
In our case, a matching can be thought of representing a choice of at most one destination for each 
source. Choose a matching M on the bipartite graph, let it be the corresponding permutation. The 
characteristic vector of a bipartite matching is given by the vector (x-ij) : Xij = 1, if (i, j) € M, 
otherwise Xjj = 0. 

Consider the interference channel from si,...,S£ to d^m, d^r^y For the s^t^rn) pairs that are 
connected, we can achieve a rate of ^C(2P) using the strategy of Lemma [lOl 

This implies that a rate given by \C(2P) times the characteristic vector of the bipartite matching is 
achievable. 

Now, we can achieve any convex combination of the rates given by matchings on the graph. This is 
given by the following polytope, called the matching polytope, 

M = conv { (x^ )m I M a matching } . (40) 

By a theorem in bi-partite graph matchings 11771 . this matching polytope can be alternately described 
as: 



V = { Mfo > VU, f*^** | 1 * \ • (41) 



Therefore the achievable rate region is given by (ISTb . 

The cut-set bound 1 implies the following, which are only a subset of the cuts (the cuts which 
separate one node from all the others) : 



(Ri 



Ei^eE Rii < Elog(l + zZr. {id) eE h%P) Vj e T 



Now, due to the concavity of the logarithm, 

Eio g (i+ h %) ^ lQ g( 1+ E m l p ) < 42) 

j-(i,j)eE j-(i,j)&E 

< log(l + dP) (43) 

< Elog(l + adP\h\ 2 ) = C{adP) (44) 

where the last step follows because of the convexity of the function f{x) = log(l + ce x ), i.e., applying 
Jensen inequality for the aforementioned convex function, we get, 

E(log(l + c|/i| 2 )) = E{log(l + ce log|h|2 )} (45) 

> log(l + ce E(log| ' l|2) ) (46) 

= log(l + CO" 1 ), (47) 

where a := e"^ \ h ^). 

Thus the cut-set bound implies the following inequalities, 
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• Therefore we get the result that 

T%*{P) 2 -2^. (48) 

□ 

Thus for a fading X channel, the rate region is exactly polymatroidal, approximately close to the cut, and 
is exactly reciprocal (since the description of the rate region remains the same even the channel is reversed). 

4.6 Fixed X -channels 

Consider a L-user X-channel with fixed channel coefficients drawn from a continuous distribution. We will 
obtain a degrees-of-freedom characterization of this X-channel (which holds almost surely). The channel 
model can therefore be written as, 

yi {t)= hijxjit) + Zi(t) W = 1,2,..,T, (49) 

jeln(i) 

where Xi(t),yi(t), Zi(t) are the transmitted vector, received vector and noise vector at time t, In(i) represents 
the set of neighbors of node i who have an incoming edge to i and channel coefficient hij associated with 
edge £ E is drawn from a continuous distribution which has a probability density function, i.e., the 

probability measure is absolutely continuous with respect to the Borel measure. The noise vector is assumed 
to have unit variance and is independent at each node. There is a power constraint of P per node. 

First, we consider the case of L-user interference channel, where there are messages only from Sj to t% for 
each i. The following result characterizes the degrees of freedom of the L-user interference channel: 



Lemma 11. H64V For a k-user interference channel with channel coefficients drawn from a continuous distri- 
bution, if the direct links are non-zero, the following DOF tuple is achievable almost surely: 

Di = \, Vi = l,2,...,fc. (50) 

We can generalize this interference channel scheme to the X-channel with L sources and M destinations, 
using the same method as in Theorem [2] 

Theorem 3. For an L-source, M -sink fixed X -channel, the following DOF region is achievable almost surely, 

^=((A,)|^^|? (51) 

and furthermore, 

TfLh^-f a.s. (52) 

Proof. The proof proceeds in a manner quite similar to that of Theorem |2l the only difference is that we use 
Lemma [TT1 instead of Lemma [TOl Also, since we are dealing with DOF, which is an SNR-scaling characteri- 
zation, the (constant) power scaling factor is not relevant. □ 

Thus for a fixed X channel, the achievable DOF region is exactly polymatroidal, approximately close to 
the cut, and is exactly reciprocal (since the description of the achievable DOF region remains the same even 
the channel is reversed). 
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4.7 Fading Linear Deterministic Channels 

In Sec. 14.11 we considered linear deterministic networks which had only broadcast and MAC components. In 
this section we will consider a general linear deterministic network with fading. The communication network 
is represented by a directed graph G = (V, E). If (i, j) 6 E (j, i) 6 E, we call the network bidirected. 

The edges that are present have fading matrix Hij(t) on them. Each fading coefficient in each matrix is 
distributed i.i.d. fading over edges and time. The fading distribution for each non-zero coefficient is assumed 
to be uniform over the finite field (except zero) ¥ g 0. The proof can be extended to the case where the fading 
takes value but there will be penalty factor of 

For a linear deterministic interference network with fading, there is a scheme based on ergodic interference 
alignment that achieves half the point-to-point rate for each user: 

Lemma 12. H65\l For an k-user fading linear deterministic interference channel, with direct links being non- 
zero, the following rate type is achievable: 

Ri = ^log 2 (<7), Vi = l,2,...,fc. (53) 

We can use this scheme to create a scheme for the X-channel, the rate region of this scheme is quantified 
in the following theorem. 

Theorem 4. For an L-source, M -destination deterministic ergodic X -channel, the following rate region is 
achievable, 

Kch 2 ^ (54) 
furthermore, only cuts that separate one node from the rest are sufficient. 

Proof. The proof for this case is very similar to the proof of Theorem 12 except that the term ^C(2P) is 
now replaced by ilog 2 ((?). Also, there are no power scaling losses in the case of a linear deterministic 
network. □ 

Thus for a linear deterministic X channel with fading, good local schemes exist; furthermore the schemes 
have a rate region which can be described by polymatroidal constraints. 

5 Approximate Capacity Results for Wireless Networks 

In this section, we will present approximate capacity results for wireless networks with multiple unicast traffic 
for several channel models. Our achievable schemes will use a layering architecture and we will use the cut- 
set bound as the outer bound. For each channel model, we will have the description of the wireless network 
as a graph G = (V, E). There are k designated source nodes si, s^, which have independent messages 
and k corresponding destination nodes t\,...,tk'. U wants to decode the message of Sj with vanishingly small 
probability of error, for every i = 1, . . . k. Let TZ c J^ h denote the rate region comprising of the rate tuples 
achievable and let TZ^ t denote the rate region corresponding to the cut-set bound for the channel model ch. If 
a power constraint P is present, we will describe these regions as a function of P, i.e., 7l c ^. h (P) and IZ^^P). 

5.1 Gaussian networks with MAC and Broadcast components 

We consider Gaussian networks in which there are only broadcast and MAC components, i.e., there is no 
interference channel component. This is equivalent to the assertion that each edge is involved in either a 
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superposition constraint or in a broadcast constraint but not in both. In practice, such a network can be realized 
by using a partial frequency reuse scheme, where the total bandwidth is divided into different chunks, which 
are assigned to users in such a way that interference component is avoided. 

Gaussian networks with MAC and broadcast components alone have been previously considered in R6l . 
where it has been shown that for a network composed of 2-user MAC and broadcast channels, a separation 
architecture involving local coding is approximately optimal. Since this general structural result holds for all 
possible traffic models, it may be necessary in general to use network coding (6] B4l at the network layer. 
In contrast, in this paper, we will assume multiple unicast traffic and utilize the reciprocity of the wireless 
network to show that the cut-set bound can be approximately achieved using a separation scheme along with 
routing. 



Inflow at 1 : 

r„<c, 




1(23) 



(a) Example of Frequency planned 
network 



(b) Equivalent noiseless network 



Figure 3: A Network composed of broadcast and MAC components and its Polymatroidal Equivalent 



Network Model: The communication network is represented by an undirected graph G = (V,E), and an 
edge coloring ip : E — > C, where C is the set of colors. Each node v has a set of colors C(v) C C on which 
it transmits and receives. Each color can be thought of as an orthogonal resource (for example, a frequency 
band), and therefore the broadcast and superposition constraints for the wireless channel apply only within a 
given color. We will assume the nodes are equipped with full duplex radios on each of these resources. For 
simplicity of notation, we will assume that there is a one-to-one correspondence between colors and channels, 
i.e., each channel operates on a distinct color; so c stands for a unique color and a unique channel. 
The channel model can therefore be written as, 

Vi= E h ij x j+^ VceC(i), (55) 

jeA/" c (i) 

where xf,yf,zf are the transmitted vector, received vector and noise vector on color c, hfj = hj { is the 
channel coefficient between node i and node j on color c and N c {i) represents the set of neighbors of node 
i who are operating on color c and d c (i) = \Af c (i) \ be the degree of node % in color c. Let d = max C] „ d c (v) 
be the maximum degree of any node in a given color; therefore, d is the maximum number of users on any 
component broadcast or multiple access channel. Each node has a power constraint P per edge. Therefore 
node v has power constraint Pd c {v) for transmitting on color c. By the very definition, this network has a 
reciprocal MAC channel for every broadcast channel and vice versa. Let V(c) = {v : c € C(v)} be the set 
of nodes that use the color c. An example of a wireless network along with its equivalent noiseless network 
are shown in Fig. [3] 
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Theorem 5. For the k-unicast problem in Gaussian network composed of broadcast and multiple access 
channels, a simple separation strategy can achieve a rate, 



This means that the min cut, scaled down in power by a factor d and in rate by a factor C(log A;), can be 
achieved. For the unicast scenario (k = 1), we can show using a similar proof that 

fcL.tf') 2 (57) 



This result is similar to that obtained by 11121 . except that here it is obtained for the special case of networks 
composed of broadcast and multiple access channels. The scheme in [12] requires a global physical layer 
scheme (the "quantize and map" strategy), while for the special case of networks here we show that a simple 
separation strategy suffices. 

5.1.1 Coding Scheme: Proof of Theorem |5] 

The coding scheme is a separation-based (layered) strategy: each component broadcast or multiple access 
channel is coded for independently creating bit-pipes on which information is routed globally. The achievable 
scheme used for the MAC and broadcast channel are discussed in detail in Sec 14.21 The achievable rate region 
for the MAC and Broadcast channels described there are polymatroidal and therefore each multiple access 
or broadcast channel with d users can be replaced by a set of d bit-pipes whose rates are jointly constrained 
by the corresponding polymatroidal constraints. Thus we get a polymatroidal network by using this layered 
strategy; this polymatroidal network is described as follows: for each node v in the original graph, there 
are several vertices v c , one for each color c 6 C(v). There is an edge between u c and v c if h c uv ^ 0, the 
polymatroidal constraints are given by 

p-(F w ) = log(l+ \ h ™\ 2 P) VF V C<T«) (58) 

p+(F,) = log(l+ \ h vu\ 2p ) VFvC6+(v c ), (59) 

u:(u,v)£F v 

and the polymatroidal network is bidirected due to the fact that h c uv = h c vu and the reciprocity in the rate 
regions of the MAC and BC channel. Further, there are edges between v c and v c > of infinite capacity, since 
these correspond to the same node v in the original graph. 

Let us call the cut-set bound region on this induced polymatroidal network as 7^° t ly (P), and let us call the 
rate region for the flow-based achievable scheme as 1Z^(P). Then we have from Theorem Q] that 



KZ(P) ^ ^TZ^- (60) 



— n cut V 

Ologfc 

As an example, the bidirected polymatroidal network induced for the example of Fig. [3a] is shown in 
Fig. EH The submodular constraints are explicitly written down only for node 1, but similar constraints apply 
at nodes 2, 3 and 6. In this figure, C21, C31 and 0/23)1 represent constraints on the rate of communication from 
node 2 to 1, the rate of communication from node 3 to 1 and the sum rate from nodes 2 and 3 to 1 respectively. 

It is now sufficient to compare the cuts on the polymatroidal and the Gaussian network. 

Lemma 13. 

K 8 cut {P)QK p *?{dP). (61) 
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Proof. Given a cut Fq in the polymatroidal network, we will show that there is a corresponding cut in the 
Gaussian network, whose value is within a power scaling factor d of the polymatroidal cut. The value of the 
cut in the polymatroidal network is u(Fq) = ^2 c v(Fq), i.e., the polymatroidal cut breaks up into the sum 
of the cuts of various colors. The value of cut in a given color v{F^) corresponds to a certain polymatroidal 
constraint in a given MAC or broadcast channel. We need to show that there is a similar cut in the Gaussian 
network whose value is within a power scaling factor. 

As shown in Lemma |4j we have that for each of these channels, the Gaussian cut and the polymatroid 
representing the achievable scheme are within a power scaling factor of d, and therefore 

K*(P) C K^(dP), (62) 
since both polymatroidal and Gaussian cuts decompose into sum of individual cuts. □ 
The achievable rate using the separation strategy is given using (l60l) as 

-r>p°iy/p\ -r>s (P\ 

TpP ol y(p\ ~-> /v cut K£_) /v cutV d > / fi a\ 

achl '- O(logfc) -0(logJfe)" ( j 

This completes the proof of Theorem [5] 



5.1.2 Special Traffic Scenarios 

We now present results for directed networks with MAC and broadcast components under the special traffic 
patterns presented in Sec. 13.41 

Theorem 6. For a directed Gaussian network composed of broadcast and multiple access channels, a simple 
separation strategy can achieve a rate, 

UUP) 2 MBCTr^, (64, 

R'S(P) 3 forXTraffic, (65) 

RTc"h( P ) 2 c "' 2 for group-communication traffic. (66) 

Proof. The proofs are based on the polymatroidal results Lemma [T] Lemma |2] and Lemma [3j The proof is 
very similar to the proof of Theorem [5] and are therefore omitted. □ 



5.2 Broadcast Erasure Networks with Commensurate Feedback 

Broadcast erasure networks, in which there are broadcast but no superposition constraints, serve as high level 
models for communication in wireless networks. Unicast in broadcast erasure networks is well understood, 
for which it has been shown lfl9l that min-cut is achievable using a global linear network coding scheme 
(in ll53l . it is shown that knowledge of erasure locations is not necessary at the destination) . It has also 
been shown that a separation scheme in which each broadcast erasure channel is coded for locally to create 
noiseless links does not perform very well. This is due to the fact that for each broadcast erasure channel 
the capacity region is far away from the min-cut region. However, as shown in Sec. 14.31 by utilizing ACK 
feedback, the capacity region is enlarged to become closer to the min-cut region. Therefore we consider a 
network of broadcast erasure channel, where each channel has a feedback mechanism. 

Consider a network which is composed of broadcast erasure channels, with an appropriate mechanism 
for feedback built into the network. In particular, we look for feedback that is commensurate; formally: a 



23 



channel is said to have commensurate feedback if there are feedback links from the various receiving nodes 
to the transmitters with the same rate region as the cut-set bound for the forward channel. The reciprocal 
nature of wireless channels from which the broadcast erasure channel is constructed naturally suggests a 
way of providing feedback links of commensurate strength. In Appendix [0 we look at one possible way of 
obtaining feedback links of commensurate strength as the forward links. 

From now on, we will assume that each erasure broadcast channel has commensurate feedback, without 
cognizance to how this particular rate region was obtained. For simplicity, we will assume that all broadcast 
erasure channels are symmetric and independent, i.e., each broadcast erasure channel has erasure independent 
probability e. 

Theorem 7. For the k unicast problem in a network of erasure broadcast channels with commensurate feed- 
back, a simple separation strategy can achieve a rate 

nj erasure 

<r> erasure — , ' v c»f (f\l\ 

/Coc/l - 0(logA;)O(logd max )' K ° ' 

where d max is the maximum number of users in any broadcast erasure channel. 

Proof. Consider the following separation strategy: even though the feedback links have a rate region given 
by R^, we will restrict them to use the rate region #^fb m order to preserve symmetry. The feedback links 
are used for two distinct puiposes: 

• To provide Ack / Nack feedback. This feedback has an overhead of 1-bit per packet which we treat as 
negligible. This assumption makes sense especially when packet lengths are large. 

• To route flows on the reverse direction, since the Ack feedback overhead is assumed to be small, this 
will essentially occupy the whole capacity. We establish a bidirected network by using the feedback 
links for routing. 

Since we have the Ack / Nack feedback for each erasure broadcast channel, we can use the scheme of Lemma[6] 
to obtain the rate region R^j^ with feedback. This induces a bidirected polymatroidal network specified in 

the following manner, in which we can use flows to achieve a rate region R^± ■ By TheoremQ] we have that 

-r>p°iy 

By Lemma|7| we have TV^htb 5 o{\og"d — )■ F urm er, since cuts in the polymatroidal and the original network 
decompose into cuts for each channel, any cut-set in the polymatroidal network induced by the achievable 
scheme has a counter-part cut-set in the erasure network within a factor of C(log d max ): 

<i-> erasure 

= n( C " y (69) 



"T-> erasure 

ach - 0(logfc)0(logd max )' k ; 



Now ([681) and ([69j> together imply, 



which proves the desired result. □ 



24 



5.2.1 Special Traffic Scenarios 



We now present results for directed networks with broadcast erasure channels under the special traffic patterns 
presented in Sec. 13.41 Since the networks are directed, reciprocity is not needed; however we will continue to 
assume that the broadcast erasure channel has ACK feedback. 

Theorem 8. For a directed network composed of broadcast erasure channels with ACK feedback, a simple 
separation strategy can achieve a rate, 

n ach D - .^ Cl " for BC Traffic, (71) 

log(d max + 1) 

nsum 

fi = 3 tog5w+T) *™ <72) 

R S ZT 5 cut r for group-communication traffic, (73) 

21og(d max + 1) 

where <i max is the maximum degree of the broadcast channel. 

Proof. The proofs are similar to the proof of Theorem |7] and are therefore omitted. □ 



5.3 Gaussian Fast Fading Network 

We will now consider a general Gaussian network where broadcast and superposition can simultaneously 
occur, i.e., the network can contain interference components. Such a network is clearly more general than 
the .ff -user interference channel, and even this channel has not been well understood in the most general case 
( the tutorial IT371 provides an excellent summary of the current understanding on this channel). However, 
in the presence of fast fading, the problem gets symmetrized considerably ll65l . and there is a reasonable 
understanding of this problem. Therefore we will resort to the fast fading model in this section. We will 
further assume that the fading distribution satisfies the assumptions in Sec. 14.51 

While most of the existing literature is on single hop interference channels, multi-hop interference net- 
works have been the focus of more recent work. In particular, it has been shown in ll38l that the degrees 
of freedom of such fully-connected layered networks can be achieved using a non-separation scheme called 
opportunistic interference alignment. It has also been shown ll32l that a separation architecture does not even 
achieve the DOF for simple networks, for example, the network with 2 sources, 2 relays and 2 destinations. 
Our results offer a contrast: if we look to achieve within an approximation factor of cut-set then a simple 
separation strategy suffices, for all SNR. 

For examples of networks considered here, we refer the reader to Fig. @] for a non-layered example or the 
one in Fig.|5]for a layered example. 



»1 




Figure 4: A Multiple-Unicast Wireless Network 
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(a) Layered Network (b) Cuts 

Figure 5: Layered Network and Cuts 



Theorem 9. For a bidirected ergodic wireless network with k source destination pairs, the rate region given 
by 

K 9 (A) 

n9{p) ^ W (74) 
C(log k) 

is achievable using a separation strategy, where d is the maximum degree of any node and b := - — ( ^ ' } JS 
a constant depending on the fading distribution. If the fading is complex Gaussian, b ~ 0.86. 

Proof. We will now show a scheme by which we can convert the bidirected ergodic wireless network into a 
bidirected polymatroidal network. We will view one snapshot of transmission in the network as a transmission 
in a bipartite graph, from the set of nodes V = V on the left side to the set of nodes V r on the right side. 
Node u on the left is connected to node v on the right with channel coefficient h uv from the original network. 
The nodes have infinite memory, so each node is connected to itself with infinite capacity. We view the 
obtained network as a single hop network and use a scheme for the single hop (described in Sec. [43). The 
achievable rates for this single hop network are given by polymatroidal constraints, and therefore we obtain 
a polymatroidal network represented by a bi-partite graph with 2V nodes. This process can now be reverted, 
i.e., we can go from a layered representation with 2V nodes to a non-layered representation with V nodes. 
Thus we obtain a polymatroidal network with V nodes. There are edges E similar to the original graph, 
however the capacities are constrained according to the polymatroidal constraint: 



R uv < \C{2P) Mv e 



V (75) 



Rvu<\c(2P) Vuey. (76) 

ugOuto 

This is now a bidirected polymatroidal network since the polymatroidal constraint is symmetric for the in- 
coming and the outgoing edges at any given node. Now we perform routing over this bidirected polymatroidal 
network. Now the flow and cut on this network are related by: 

<r>p°iy 

****** ~ O(logfc)- (//) 

If we can relate the cut-set bound on the polymatroidal network and the cut-set bound on the original 
Gaussian network, we can get the desired result. This is done in the following lemma: 
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Lemma 14. 

2 "" \bd 3 

Proof. See Appendix |F] □ 
Now, ((771) and <|78l» implies that 

^original / p \ 

^ original _ /v cut \bd? J nQ ^ 
'^ach — /r\n 7 \ ) \'?) 

©(log k) 

which completes the proof of the theorem. □ 



5.3.1 Multi- Antenna Nodes 

We can prove a result similar to the one in Theorem [9] even when there are multiple antennas at the nodes. 

Theorem 10. For a bidirected ergodic wireless network with multiple antenna nodes, the rate region for 
k-unicast is given by 

*(P) 2 ^r4r> (80) 
C(log k) 

is achievable using a separation strategy, where d is the maximum number of antennas that can communicate 
with any given antenna and b := - — ( 2 Bl 1 1 is a constant depending on the fading distribution. 

Proof. The proof of the theorem is very similar to that of Theorem [9] except that when there are multiple an- 
tennas, each antenna is treated as a separate node with infinite capacity wireline links between antennas of the 
same node. This scheme can be shown to achieve the cut-set of this new network to within the approximation 
factor. We observe that any finite cut in this new network will partition the antennas in such a way that all 
antennas corresponding to the same original node will lie on the same side of the partition, since otherwise 
the value of the cut will become infinite. Therefore the cut-set in the new network and the original network 
have the same value and this completes the proof of the theorem. □ 

5.3.2 Special Traffic Scenarios 

We now present results for directed fast fading networks under the special traffic patterns presented in Sec. 13.41 
Since the network is directed, reciprocity will not be necessary to prove this result. 

Theorem 11. For a directed fast fading Gaussian network with multiple antenna nodes, a simple separation 
strategy can achieve a rate, 

Tl ach (P) D cut ^ f for BC Traffic, (81) 
jjsum f P \ 

Kh(P) ^ CUt 2 for X Traffic, (82) 

j^sum f \ 

Pach(P) — — - — — for group-communication traffic, (83) 

where d is the maximum number of antennas that can communicate with any given antenna and b := 
- — ( 2 1 ) I s a constant depending on the fading distribution. 

Proof. The proofs are similar to the proof of Theorem [9] and are therefore omitted. Here, the factor loss of 2 
is present for the BC scenario due the factor 2 loss in the local physical layer scheme (see Theorem[2]). □ 
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(a) Wireless Network (b) Polymatroidal Network 

Figure 6: Directed Layered F.C. Networks 



5.3.3 Special Channel Model: Directed Layered Network 

In this section, we consider directed fully-connected (f.c.) layered networks. These are layered networks, 
which have connectivity between adjacent layers only in the forward direction, i.e., links are always between 
a node in Vi to a node in Vi+i. Further for a fully-connected network, we assume that (u, v) E £ Vu € 
Vi, v S Vi+i. Consider, for example, the network in Fig. [6a] 

Theorem 12. For a directed fully-connected layered ergodic wireless network with k-distinct sources in the 
first layer having messages to k distinct sinks in the last layer, the rate region given by 

K{P) D Ucut ^\ (84) 

is achievable using a separation strategy, where d is the maximum degree of any node and b := e E(1 ° g|h| \ 



Proof. The achievable scheme is again a local Phy scheme, i.e., for each hop in the layered network we use the 
strategy for X-channels described in Sec. 14.51 Now we get a special type of directed polymatroidal network 
for which we can show that max-flow equals min-cut. 

Theorem 13. For a k-unicast problem in a polymatroidal directed layered network in which all the node 
constraints are of the form 

Ruv<l, W, (85) 

M6/«{t)} 

Ruv<l, Vn, (86) 

v£Out{u} 

the rate region given by max-flow equals the min-cut for the k-unicast problem. 

Proof. See Appendix [G] □ 
Now we can use Lemma [T4l to get: 



KcfV) = H$(P) = K^(P) 2 \is\, (87) 



1 original J P 

2 cut \bd 3 

which implies the theorem. □ 
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5.4 Fixed Gaussian Network 



Consider a general Gaussian network where broadcast and superposition can occur simultaneously, and where 
the channel coefficients are fixed but drawn from a continuous distribution. We studied a simple single-hop 
version of such a network in Sec. 14.61 We will now show that the local scheme can be placed in a network 
context to get an almost-sure DOF characterization. 

Theorem 14. For a bidirected fixed multi-antenna Gaussian network with k source destination pairs, the 
DOF given by 

V ach D n V , CU \ v (88) 
0(log k) 

is achievable almost surely using a separation strategy. 

Proof. The proof proceeds analogously to Theorem [lOl but instead of using the local scheme in Theorem [2 
the scheme of Theorem [3] is used and therefore a DOF characterization is obtained. □ 

5.4.1 Special Traffic Scenarios 

We now present results for directed Gaussian networks under the special traffic patterns presented in Sec. |3.4| 
Since the network is directed, reciprocity will not be necessary to prove this result. 

Theorem 15. For a directed Gaussian network with each channel coefficient chosen from a continuous dis- 
tribution, a simple separation strategy can achieve a DOF region, 

(89) 
(90) 
(91) 



D ach 


D 


2 


for BC Traffic, 


r\sum 
u ach 


D 


r)sum 

2 


for X Traffic, 


u ach 


D 


j~)Sum 

4 


for group-communication traffic 



5.5 Linear deterministic networks with MAC and Broadcast components 

A linear deterministic network composed of MAC and broadcast components is defined in the same way as the 
Gaussian network composed of MAC and broadcast components. The key difference is that the transmissions 
are over a finite field ¥ q , there is no noise, and the channels between node i and j of color c are matrices 
in general: Hfj. The main result for linear deterministic networks with MAC and broadcast components is 
stated in the following theorem: 

Theorem 16. For the k-unicast problem in linear deterministic network composed of broadcast and multiple 
access channels, the layered architecture can achieve a rate, 



TZ Id , D £01 (Q7) 

ach ~ 0(logfc)' 



Proof. The proof of this theorem proceeds in a similar way as that of Theorem [5J the key difference being the 
fact that for linear deterministic networks, the cut-set bounds under product form and general distributions are 
the same, and therefore, there is no power scaling factor. □ 
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T^ach 


D 


7Zcut 


/or BC Traffic, 


rjsum 
^ach 


D 


jysum 


for X Traffic, 


rjsum 
^ach 


D 


Tjsum 
2 


for group-communication traffic 



5.5.1 Special Traffic Scenarios 

We now present results for directed linear deterministic networks with MAC and broadcast components under 
the special traffic patterns presented in Sec. |3.4i 

Theorem 17. For a directed linear deterministic network composed of broadcast and multiple access chan- 
nels, a simple separation strategy can achieve a rate, 

(93) 
(94) 

(95) 

Proof. The proofs are similar to the proof of Theorem [16] and are therefore omitted. □ 

5.6 Networks of Fast Fading MAC and Broadcast Channels with delayed CSIT 

We will now networks composed of fast fading MAC and Broadcast channels where the channel states are 
i.i.d. over antennas and time, and the CSI is available at the transmitting nodes after a delay. Schemes for 
each of these local channels was studied in Sec. 14.41 and a degree of freedom characterization was obtained. 
Now, we will try to obtain the degree of freedom region for multiple unicast over a network of such channels. 
Our main result for such networks is the following. 

Theorem 18. For the k unicast problem in Gaussian network composed of fading broadcast and multiple 
access channels with delayed feedback, a simple separation strategy can achieve a DOF, 

D 

Vach - o{\ogk)o[\ogp^y (96) 

where 

Pmax = m&x p(BC channel), (97) 

BC channels 

and p(BC channel) is given by the minimum of number of transmit antennas and the total number of received 
antennas in the broadcast channel. 

Proof. The coding scheme is again a separation-based strategy: each component broadcast or multiple access 
channel with delayed feedback is coded for independently creating bit-pipes on which information is routed 
globally. The physical layer technique is the scheme for MIMO broadcast and MAC channels with delayed 
CSIT proposed in Sec. 14.41 The proof is very similar to that in Sec. 15.21 

For the fading MIMO broadcast channel with delayed feedback, we can achieve the following DOF region 



MAC 
ach 



^ach w i tn feedback. For the fading MIMO multiple access channel, we can achieve the region given by V. 
however we will restrict ourselves to achieve the smaller rate region for the purpose of symmetry. This 
induces a bidirected polymatroidal network, in which we can use flows to achieve a rate region R p a °^ ■ By 
Theorem [H we have 

ppoly /v cut mox 

R ^ ~ OQ^k)- (98) 

"J^BC MAP £>MAC 

By Lemma m we have T>,x, ^ t^tt — m — r and by choice, = 7nn cul r • Further, since cuts in 

J ' ach — C>(logp max ) J ' ach C(logp ma x) ' 

the polymatroidal and the original fading network decompose into cuts for each channel, any cut-set in the 



30 



polymatroidal network induced by the achievable scheme has a counter-part cut-set in the erasure network 
within a factor of C(logp max ): 

..fading 



7C' y = 7^ (99) 

0(logp max ) 

Also, we can achieve the same DOF in the original fading network as the rate in the polymatroidal network, 
i.e., 

THg = T%r. (100) 



v: 



This implies that 



■^fading ^cut 

ach " 0(log 3 k)0{ 

This completes the proof of the theorem. □ 
5.6.1 Special Traffic Scenarios 

We now present results for directed networks with MIMO broadcast and MAC channels with delayed feedback 
under the special traffic patterns presented in Sec. 13.41 Since the networks are directed, reciprocity is not 
needed; however we will continue to assume that the broadcast channel gets delayed CSI feedback. 

Theorem 19. For a directed network composed of MIMO broadcast and MAC channels with delayed CSI 
feedback, a simple separation strategy can achieve a DOF region, 

V ach D Vcut for BC Traffic, (102) 

log(Pmax + 1) 
r\sum 

D = 2 M^TT) **™ < 103 > 

j-j s urn 

D s "% D — — for group-communication traffic, (104) 

21og(p max + 1) 

where 

Pmax = max p(BC channel), (105) 

BC channels 

andp{BC channel) is given by the minimum of number of transmit antennas and the total number of received 
antennas in the broadcast channel. 

Proof. The proofs are similar to the proof of Theorem [18] and are therefore omitted. □ 
5.7 Fading Linear Deterministic Network 

Consider a linear deterministic network of the form defined in Sec. 14.71 where each of the non-zero channel 
coefficients Hij(t) undergo i.i.d. fading with a uniform distribution on the non-zero elements. Then the local 
scheme of Sec. |4.7| can be extended to a global network scheme. 

Theorem 20. For a bidirected linear deterministic network with k source destination pairs, the rate region 
given by 

n ach D ,.. 7 f"'' , (106) 
C(log k) 

is achievable using a separation strategy. 

Proof. The proof proceeds analogously to Theorem |9j but instead of using the local scheme in Theorem |2j 
the scheme of Theorem |4] □ 



fading 



aCh " 0(l0g 3 fc)0(l0gp m ax)' 
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5.7.1 Special Traffic Scenarios 

We now present results for directed fast fading linear deterministic networks under the special traffic patterns 
presented in Sec. 13.41 Since the network is directed, reciprocity will not be necessary to prove this result. 

Theorem 21. For a directed fast fading linear deterministic network, a simple separation strategy can achieve 
a rate, 

n ach D ^ for BC Traffic, (107) 
-psum 

RS 2 -f- far X Traffic, (108) 

j^sum 

Rach — — for group -communication traffic. (109) 
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A Proof of Lemma ID 

The rate region for cut-set under product distribution is given by: 



7-, MAC 
'^cut.product 



(P) = jp:^^<log(l + ^|^| 2 P)l. (110) 



The rate region for cut-set under general distribution is given by: 



7-, MAC 
'^cut.general 



(P) = j P : ]T P, < log(l + (£, I^D 2p ) \ ■ CUD 

I i€S i£S J 



By the Cauchy-Schwarz inequality, we get 

2 



which in turn implies 



We can similarly show that, 



(^n) p < (^T\hiAdP, (ii2) 

Vies / Vies / 



^-cut,general(-f) — ^cut,product i^P ) ■ (H3) 



^-cut,general(-f) ~~ ^cut.product (dP) ■ (114) 

Along with the equality in (fl71) . this implies that 

^general( J P)^« C (^)- (H5) 

We similarly get, using (|2TI ) and (11141 ). 

^Sg^nd^C ^(dP). (116) 

B Proof of Lemma |5] 

Without feedback the capacity of this erasure broadcast channel can be easily found. This is because this 
erasure broadcast channel is stochastically degraded ifTTl . and the capacity is given by 

(Pi,...,P d )| Yl Ai<l-e[. (H7) 

i=l,2,..,d J 

The rate region can be achieved by time sharing between the individual links. We can compare this rate to the 
cut-set bound which is given by 

Pcut=|(Pi,...,PD)|^P l <l-e |J| VJC {l,2,..,d}i . (118) 

The ratio between the sum rate of the scheme and the cut-set bound is the factor 

which — > -, as e — >• 1. (119) 



1 - e d ' d 
As expected, the time sharing region does not compare very favorably to the cut-set bound 
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C Proof of Lemma H 



Consider the rate region with feedback 7£ a ch,fb- We would like to know for what value of A does ATZ ac ^fb 5 
TZ cut . Let us take a point in 1Z cxA , we would like to know, for what value of A does this imply Yl%=i 2 d T^t 7 — 
A. This is equivalent to 

n R 

i = max ElI7' (120) 

i=l 

such that, 

^i?i<l-e |J| VJC {1,2,. .,d}. (121) 
This is a linear optimization over a polymatroid and the optimal solution is given by the greedy algorithm 



R d ) = (1 - e, e - e 2 , e™" 1 - e"), (122) 
and the optimal value of the objective function is 



e i-i _ e i 



Lets examine the 2-th term in this sum, substituting x = e 1 , 



— 1 l + + + 1 i 
Therefore the sum is upper bounded by 



X ~ l <L (124) 



A<^-<logd. (125) 



1 

1 



D Feedback: Multiple Access Erasure Channel 

Consider a finite field multiple access erasure channel, where 

d 

y = '^2e i x i , (126) 

i=l 

where ei are i.i.d. Bernouli with probability 1 — e. In this channel, some of the transmitters' packets can get 
erased, and the received vector is the sum of those packets that did not get erased. 

This multiple access channel is the dual of the broadcast erasure channel in the sense that the cut-set 
bound of the two channels are identical. This channel can be realized physically by having using computation 
code on the wireless channel, which computes the required linear combination. If all the channel coefficients 
in the wireless channel are good, then the combination Ylt=i %i can be computed easily. However if one 
of the channel coefficient, say hj , is in deep fade, then it may be algorithmically hard to compute this linear 
combination. Therefore, one way around this problem is to avoid having X, in the linear combination and 
instead compute £) i=1 2 d tyj -^i- This gives rise to the channel model in d 126b - 
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The capacity of this multiple access channel is given by the cut-set bound (which is the same as the 
broadcast channel cut-set bound), 

TZ MAC = TZ^f = {(R 1 ,...,R D )\^2R l <l-e\ J \ VJ}. (127) 

ieJ 

Here J C {1, 2, .., d}. We emphasize that the capacity region of this multiple access channel is equal to the 
cut-set bound of the erasure broadcast channel. 

E Proof of Lemma @ 

We will first consider the case of a network with I transmit antennas and m single-antenna receivers. We can 
assume I < m, since if I > m, we can restrict ourselves to using m transmit antennas, which leaves the cut 
unaltered. Therefore p = I. 

We can choose any particular subset of I receivers and use the strategy in Lemma [8] to achieve a DOF 
of Q(i og ij f° r eacri receiver. We can time share between all possible subsets of size I to achieve a certain 
DOF region. To compute the rate region achievable by this method, we use the following trick: Let the DOF 
tuple achieved be Qn^rr) ( r i> ■••> r m)- Let us construct a bi -partite graph with I nodes on the left partition 
and m nodes on the right partition and a complete graph connecting them. Each matching is equivalent to 
choosing a certain subset of the receivers (given by the set of right-partition nodes covered by the matching) 
and achieving DOF 1 for the each of the receivers. The characteristic vector of a bipartite matching is given 
by (x^m such that = 1 for edge in the matching M and Xij = otherwise. The convex hull of 
these characteristic vectors is given by 

M. = conv {(xij)M\M a matching } . (128) 

For a bipartite graph, this is equivalent to the following polytope 11771 

V = jcXijOIxy > Vi,J, Y, X V ^ 1 yi J2 X V ^ 1 V -? j ■ (129) 

The DOF dj is given by 

d ' = o^i)X>«- (130) 

Now consider the following polytope, 

'-=ji« 20 v ** s oob)'i^opbjJ- (131) 

We can show that D ac h is equivalent to V by using the mapping 

V>:£>ach V (132) 

il>{(dj)} = (x ij ):x ij = 0(logl)^-, (133) 

and the mapping 

C-V -> Pach (134) 

C{(xij)} = (dj) : dj = —-— J2 x ij- ( 135 ) 
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Thus the region V ac ^ is achievable. Also the cut-set bound is given by 



This implies that 



(dj)\dj > Vj, dj < 1, ^dj <l\ . (136) 



V ach = J?™' , (137) 
O(logZ) 

which completes the proof of the single antenna receiver case. 

For the multi-antenna receiver case, we will treat it as being composed of many single antenna receivers 
each of which are receiving independent information and then sum up the rates. This proof will extend to this 
case to get the desired result. 

F Proof of Lemma HH 

Let us start with a cut on the polymatroidal network. A cut on the polymatroidal network specifies a vertex 
partition and along with it, specifies how to compute the value of the cut, by specifying which edges to 
group together for the sub-modular constraints. For example, consider the cut in Fig. [5b] it features the vertex 
partition and also specifies how to group edges to get an upper bound. 

From the given partition we need to construct a cut on the Gaussian network. The polymatroidal cuts 
specify which of the edges need to be grouped together. By bounding the polymatroidal network in this 
manner, each edge is involved in either a broadcast constraint or a superposition constraint. This is in contrast 
to the Gaussian case, where each cut has a certain value, and there is no sense in which edges are assigned to 
broadcast or supeiposition constraint. 

The key idea to connect these two cuts is the idea of decoupling the constraints in the Gaussian channel: 

• In the Gaussian channel, if broadcast constraint is not active for a given edge, then the edge only 
participates in the superposition constraint and vice versa. 

• While this is not true as it is in the Gaussian network, we can obtain an upper bound network where 
certain broadcast or superposition constraints are no longer active. This process of obtaining upper 
bound network where a ceratin constraint is not active is called decoupling. 

• Decoupling a broadcast constraint is easy, because a network in which edges are not involved in a 
broadcast constraint can only, in general, do better than a network where there is a broadcast constraint 
on the edges. 

• Decoupling the superposition constraint requires a bit more work, this can be illustrated using the 
following example. Suppose two edges e\ and e2 are involved in the superposition constraint in the 
following manner: 

y = xi + x 2 + z. (138) 

Then if we have construct another channel in which the edges e\ and e2 are not involved in a broadcast 
constraint, then we have, 

yi = Xi + Zi,i = 1,2, (139) 
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this channel can emulate any scheme in the original channel if Var(zj) = Yat ( z ) . if this condition is 
satisfied, then we can add up y\ + yi to get x\ + X2 + (^i + Z2), which is statistically equivalent to 
the original channel. Therefore to decouple the superposition constraint involving d variables, we need 
to reduce the variance of the noise by d, the degree of the superposition constraint, or equivalently, 
increase the signal power by a factor d. 

• Therefore we can decouple all required broadcast and superposition constraints, if the power is in- 
creased by a factor of d, which is the maximum degree of any node. 

Thus given any cut in the polymatroidal network along with the assignments of the edges to broadcast or 
superposition constraints, we can obtain a similar cut in the Gaussian network by decoupling the constraints 
which are not active in the polymatroidal cut. This incurs a power penalty factor of d, thus as far as the outer 
bound is concerned, we can assume that each node has power dP instead of P. The network thus obtained is 
made of MAC and broadcast channels. In this network, every cut decomposes into the sum of MAC and BC 
cuts. A MAC cut with d nodes when evaluated under general distribution on the input, is of the form, 

Y^Rij < Elog ^l + (^|^|) 2 dP^ (140) 

< E log ^1 + ^2 \hij\ 2 d 2 P^j by Cauchy-Schwartz inequality (141) 

< log ^l+E(^|/i, i:) | 2 )d 2 pj by Jensen (142) 

= \og(l + d 3 P) (143) 

< E {log (1 + ad 3 P\h\ 2 )} =C {ad 3 P) (144) 

The last step follows because, 

E{log(l + c|/i| 2 )} = E{log(l + ce log|/l12 )} (145) 
> log(l + ce E ( 1 °sl /l l 2 )). (146) 

Here a := e -E ( log I' 1 ! 2 ) is finite for the fading distribution, by assumption (it is equal to . Thus, the cutset 
bound for the original network implies that 

J^Rij < C(ad 3 P), (147) 

i 

whereas the corresponding cutset bound for the polymatroidal network is of the form, 

Y^Rij < \C{2P). (148) 



We get, 



wg? ^ \n™r* (^) , (149) 



where b := | = ^'"l^'' 1 ^ is a constant depending on the fading distribution. For h distributed as complex 
gaussian, b ps 0.86. 
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G Proof of Theorem M 



The k sources are in the layer Vq, and the k destinations are in the layer Vl+x- Let the number of nodes in the 
i-th layer be rtj. 



G.0.2 Max-Flow Rate 

Let Ri be the rate between the i-th source destination pair. We will route the flow in a symmetric manner, 
where the incoming flow is divided equally among all the edges going out of a node. We will compute 
constraints on the rate region achievable by this strategy. 

In the first hop, all edges going out of the i-th source will carry a flow of value The constraint imposed 
by the edges going out of the source is given by 

n 

—m<l <^ Ri<l, Vi = l,2,..,&. (150) 
The constraint imposed by the edges coming into the nodes of the first hop are given by 



kri\ 



-k < 1 J2 Ri - 711 " (151) 



The total flow carried by all the nodes in any given layer equals ^ Ri, and each node in layer / carries a 
flow of corresponding to flow i. 

In the I — th hop connecting layers V/_i and Vi, each edge carries a flow of value ^ - corresponding 

to source i, which yields a total flow of value ^ — - for each edge. The outgoing constraints on layer / — 1 
yield, 



nini +1 

The incoming constraints on layer I yield, 



i^?i ni+l <l ^J2Ri<ni, Vt = l,2,..,fc. 



(152) 



nini + x 



ni<l <=► J2Ri<n l+1 , Vi = l,2,..,k. (153) 



In the final (L + 1-th) hop also, there are constraints similar to layer 1. In particular, the outgoing con- 
straints on layer I yield, 

^^k < 1 <^> V R L < n L . (154) 

n Lk 

and the incoming constraints on the destination layer yield, 

R 

-L nL <\ < 1, Vi = 1,2,.., A;. (155) 

m 

Thus a rate pair (R\, ...,Rk) is achievable by routing iff 

^Ri < mm(ni,ni +1 ) VZ = 0, 1, L. (156) 

i 

Ri<l Vi = 1,2, ...,k. (157) 
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G.0.3 Cut-set Region 



We can easily write the following constraints, which are a subset of the cut-set bounds. 

Corresponding to the cut separating = U l i=0 Vi and Q, c , the following constraint can be written, 

J^-Ri < mm(n h ni +1 ) V7 = 0, 1, L. (158) 

i 

Corresponding to the cut given by Q = si, we can get the following constraint 

Ri<l \/i = l,...,k. (159) 

Comparing (1156b . (11571 ) and (11581 ). dl59l ), we can deduce that any rate tuple that satisfies the cut-set region 
will lie in the rate region achieved by the flow. Thus the rate region corresponding to max-flow equals the rate 
region corresponding to cut-set region. 

For example, Fig. [6b] denotes the directed layered polymatroidal network obtained from the network in 
Fig. [6a] Every node basically constrains the total inflow and outflow to be lesser than 1. 
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